Overview

Dataset statistics

Number of variables25
Number of observations50000
Missing cells0
Missing cells (%)0.0%
Duplicate rows1952
Duplicate rows (%)3.9%
Total size in memory30.6 MiB
Average record size in memory642.6 B

Variable types

NUM15
CAT8
BOOL2

Reproduction

Analysis started2020-05-01 12:39:38.585101
Analysis finished2020-05-01 12:40:48.539659
Versionpandas-profiling v2.5.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
Dataset has 1952 (3.9%) duplicate rows Duplicates
Name_1st has a high cardinality: 784 distinct values High cardinality
Name_2nd has a high cardinality: 784 distinct values High cardinality
Win has a high cardinality: 783 distinct values High cardinality
2nd_Win is highly correlated with 1st_WinHigh Correlation
1st_Win is highly correlated with 2nd_WinHigh Correlation

Variables

First_pokemon
Real number (ℝ≥0)

Distinct count784
Unique (%)1.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean400.49564
Minimum1
Maximum800
Zeros0
Zeros (%)0.0%
Memory size390.8 KiB

Quantile statistics

Minimum1
5-th percentile42
Q1203
median399
Q3597.25
95-th percentile760
Maximum800
Range799
Interquartile range (IQR)394.25

Descriptive statistics

Standard deviation229.5494294
Coefficient of variation (CV)0.5731633667
Kurtosis-1.188940829
Mean400.49564
Median Absolute Deviation (MAD)198.5362055
Skewness0.002992373256
Sum20024782
Variance52692.94052
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 1.5 31.5 34.5 44.5 ... 619.5 780.5 783.5 799.5 800. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
344 94 0.2%
 
163 90 0.2%
 
71 87 0.2%
 
764 85 0.2%
 
224 85 0.2%
 
657 84 0.2%
 
683 84 0.2%
 
369 84 0.2%
 
79 84 0.2%
 
584 84 0.2%
 
Other values (774) 49139 98.3%
 
ValueCountFrequency (%) 
1 70 0.1%
 
2 55 0.1%
 
3 68 0.1%
 
4 62 0.1%
 
5 50 0.1%
 
ValueCountFrequency (%) 
800 61 0.1%
 
799 75 0.1%
 
798 60 0.1%
 
797 64 0.1%
 
796 49 0.1%
 

Second_pokemon
Real number (ℝ≥0)

Distinct count784
Unique (%)1.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean403.15966
Minimum1
Maximum800
Zeros0
Zeros (%)0.0%
Memory size390.8 KiB

Quantile statistics

Minimum1
5-th percentile41
Q1207
median402
Q3602
95-th percentile760
Maximum800
Range799
Interquartile range (IQR)395

Descriptive statistics

Standard deviation230.0836441
Coefficient of variation (CV)0.5707010571
Kurtosis-1.191325361
Mean403.15966
Median Absolute Deviation (MAD)198.9962623
Skewness-0.007064501899
Sum20157983
Variance52938.4833
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 1.5 64.5 67.5 88.5 ... 657.5 780.5 783.5 799.5 800. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
758 88 0.2%
 
579 88 0.2%
 
47 87 0.2%
 
36 86 0.2%
 
522 86 0.2%
 
214 85 0.2%
 
548 84 0.2%
 
225 84 0.2%
 
401 84 0.2%
 
526 83 0.2%
 
Other values (774) 49145 98.3%
 
ValueCountFrequency (%) 
1 63 0.1%
 
2 66 0.1%
 
3 64 0.1%
 
4 63 0.1%
 
5 62 0.1%
 
ValueCountFrequency (%) 
800 60 0.1%
 
799 69 0.1%
 
798 59 0.1%
 
797 67 0.1%
 
796 56 0.1%
 

Winner
Real number (ℝ≥0)

Distinct count783
Unique (%)1.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean408.8901
Minimum1
Maximum800
Zeros0
Zeros (%)0.0%
Memory size390.8 KiB

Quantile statistics

Minimum1
5-th percentile40
Q1206
median422
Q3606
95-th percentile759
Maximum800
Range799
Interquartile range (IQR)400

Descriptive statistics

Standard deviation231.1599613
Coefficient of variation (CV)0.5653351874
Kurtosis-1.202715672
Mean408.8901
Median Absolute Deviation (MAD)200.3544718
Skewness-0.06282237031
Sum20444505
Variance53434.92772
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 6.5 9.5 10.5 13.5 ... 795.5 796.5 797.5 799.5 800. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
163 152 0.3%
 
154 136 0.3%
 
438 136 0.3%
 
428 134 0.3%
 
432 133 0.3%
 
314 133 0.3%
 
214 130 0.3%
 
394 130 0.3%
 
249 128 0.3%
 
155 127 0.3%
 
Other values (773) 48661 97.3%
 
ValueCountFrequency (%) 
1 37 0.1%
 
2 46 0.1%
 
3 89 0.2%
 
4 70 0.1%
 
5 55 0.1%
 
ValueCountFrequency (%) 
800 75 0.1%
 
799 89 0.2%
 
798 60 0.1%
 
797 116 0.2%
 
796 39 0.1%
 

1st_Win
Boolean

HIGH CORRELATION
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size390.8 KiB
0
26399
1
23601
ValueCountFrequency (%) 
0 26399 52.8%
 
1 23601 47.2%
 

2nd_Win
Boolean

HIGH CORRELATION
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size390.8 KiB
1
26399
0
23601
ValueCountFrequency (%) 
1 26399 52.8%
 
0 23601 47.2%
 

Who_Win
Categorical

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size390.8 KiB
2nd
26399
1st
23601
ValueCountFrequency (%) 
2nd 26399 52.8%
 
1st 23601 47.2%
 

Length

Max length3
Mean length3
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 4 66.7%
 
Decimal_Number 2 33.3%
 
ValueCountFrequency (%) 
Latin 4 66.7%
 
Common 2 33.3%
 
ValueCountFrequency (%) 
ASCII 6 100.0%
 

Name_1st
Categorical

HIGH CARDINALITY
Distinct count784
Unique (%)1.6%
Missing0
Missing (%)0.0%
Memory size390.8 KiB
Illumise
 
94
Mewtwo
 
90
Alakazam
 
87
Steelix
 
85
Clawitzer
 
85
Other values (779)
49559
ValueCountFrequency (%) 
Illumise 94 0.2%
 
Mewtwo 90 0.2%
 
Alakazam 87 0.2%
 
Steelix 85 0.2%
 
Clawitzer 85 0.2%
 
Druddigon 84 0.2%
 
Tentacool 84 0.2%
 
Joltik 84 0.2%
 
Seviper 84 0.2%
 
Roggenrola 84 0.2%
 
Other values (774) 49139 98.3%
 

Length

Max length25
Mean length8.3468
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 27 45.0%
 
Uppercase_Letter 26 43.3%
 
Other_Symbol 2 3.3%
 
Other_Punctuation 2 3.3%
 
Decimal_Number 1 1.7%
 
Dash_Punctuation 1 1.7%
 
Space_Separator 1 1.7%
 
ValueCountFrequency (%) 
Latin 53 88.3%
 
Common 7 11.7%
 
ValueCountFrequency (%) 
ASCII 57 96.6%
 
Misc Symbols 2 3.4%
 

Type1_1st
Categorical

Distinct count18
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size390.8 KiB
Water
7015
Normal
6038
Bug
 
4386
Grass
 
4257
Psychic
 
3728
Other values (13)
24576
ValueCountFrequency (%) 
Water 7015 14.0%
 
Normal 6038 12.1%
 
Bug 4386 8.8%
 
Grass 4257 8.5%
 
Psychic 3728 7.5%
 
Fire 3266 6.5%
 
Rock 2877 5.8%
 
Electric 2649 5.3%
 
Ghost 1985 4.0%
 
Dragon 1970 3.9%
 
Other values (8) 11829 23.7%
 

Length

Max length8
Mean length5.25124
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 17 60.7%
 
Uppercase_Letter 11 39.3%
 
ValueCountFrequency (%) 
Latin 28 100.0%
 
ValueCountFrequency (%) 
ASCII 28 100.0%
 

Type2_1st
Categorical

Distinct count18
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size390.8 KiB
Flying
6231
Water
 
4672
Psychic
 
4482
Normal
 
3980
Grass
 
3497
Other values (13)
27138
ValueCountFrequency (%) 
Flying 6231 12.5%
 
Water 4672 9.3%
 
Psychic 4482 9.0%
 
Normal 3980 8.0%
 
Grass 3497 7.0%
 
Poison 3089 6.2%
 
Ground 2983 6.0%
 
Fighting 2900 5.8%
 
Fire 2508 5.0%
 
Fairy 2331 4.7%
 
Other values (8) 13327 26.7%
 

Length

Max length8
Mean length5.60252
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 17 60.7%
 
Uppercase_Letter 11 39.3%
 
ValueCountFrequency (%) 
Latin 28 100.0%
 
ValueCountFrequency (%) 
ASCII 28 100.0%
 

HP_1st
Real number (ℝ≥0)

Distinct count92
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean69.06692
Minimum1
Maximum255
Zeros0
Zeros (%)0.0%
Memory size390.8 KiB

Quantile statistics

Minimum1
5-th percentile35
Q150
median65
Q380
95-th percentile110
Maximum255
Range254
Interquartile range (IQR)30

Descriptive statistics

Standard deviation25.27720041
Coefficient of variation (CV)0.365981289
Kurtosis7.081489424
Mean69.06692
Median Absolute Deviation (MAD)18.69972671
Skewness1.530787125
Sum3453346
Variance638.9368605
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 5.5 15. 22.5 29. ... 155. 167.5 180. 252.5 255. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
60 4182 8.4%
 
50 3893 7.8%
 
70 3584 7.2%
 
65 2964 5.9%
 
75 2745 5.5%
 
80 2542 5.1%
 
45 2434 4.9%
 
40 2365 4.7%
 
55 2333 4.7%
 
100 2005 4.0%
 
Other values (82) 20953 41.9%
 
ValueCountFrequency (%) 
1 60 0.1%
 
10 58 0.1%
 
20 383 0.8%
 
25 134 0.3%
 
28 65 0.1%
 
ValueCountFrequency (%) 
255 64 0.1%
 
250 52 0.1%
 
190 62 0.1%
 
170 65 0.1%
 
165 66 0.1%
 

Atk_1st
Real number (ℝ≥0)

Distinct count111
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean79.12104
Minimum5
Maximum190
Zeros0
Zeros (%)0.0%
Memory size390.8 KiB

Quantile statistics

Minimum5
5-th percentile30
Q155
median75
Q3100
95-th percentile140
Maximum190
Range185
Interquartile range (IQR)45

Descriptive statistics

Standard deviation32.69087645
Coefficient of variation (CV)0.413175515
Kurtosis0.1596992157
Mean79.12104
Median Absolute Deviation (MAD)26.01068566
Skewness0.5660703111
Sum3956052
Variance1068.693403
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 5. 12.5 17.5 21. 24.5 ... 167.5 175. 182.5 187.5 190. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
65 2502 5.0%
 
80 2436 4.9%
 
100 2355 4.7%
 
50 2322 4.6%
 
85 2121 4.2%
 
75 1971 3.9%
 
60 1937 3.9%
 
55 1866 3.7%
 
70 1858 3.7%
 
90 1856 3.7%
 
Other values (101) 28776 57.6%
 
ValueCountFrequency (%) 
5 102 0.2%
 
10 209 0.4%
 
15 58 0.1%
 
20 479 1.0%
 
22 77 0.2%
 
ValueCountFrequency (%) 
190 75 0.1%
 
185 61 0.1%
 
180 193 0.4%
 
170 123 0.2%
 
165 200 0.4%
 

Def_1st
Real number (ℝ≥0)

Distinct count103
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean74.24854
Minimum5
Maximum230
Zeros0
Zeros (%)0.0%
Memory size390.8 KiB

Quantile statistics

Minimum5
5-th percentile35
Q150
median70
Q390
95-th percentile130
Maximum230
Range225
Interquartile range (IQR)40

Descriptive statistics

Standard deviation31.63532241
Coefficient of variation (CV)0.4260733262
Kurtosis2.776770539
Mean74.24854
Median Absolute Deviation (MAD)24.26019706
Skewness1.183282874
Sum3712427
Variance1000.793624
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 5. 7.5 12.5 17.5 21.5 ... 164. 174. 182. 215. 230. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
70 3182 6.4%
 
50 3126 6.3%
 
60 2924 5.8%
 
80 2445 4.9%
 
65 2266 4.5%
 
90 2236 4.5%
 
40 2227 4.5%
 
100 2104 4.2%
 
55 2071 4.1%
 
45 1893 3.8%
 
Other values (93) 25526 51.1%
 
ValueCountFrequency (%) 
5 102 0.2%
 
10 64 0.1%
 
15 221 0.4%
 
20 267 0.5%
 
23 73 0.1%
 
ValueCountFrequency (%) 
230 214 0.4%
 
200 155 0.3%
 
184 46 0.1%
 
180 200 0.4%
 
168 57 0.1%
 

SpAtk_1st
Real number (ℝ≥0)

Distinct count104
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean72.8939
Minimum10
Maximum194
Zeros0
Zeros (%)0.0%
Memory size390.8 KiB

Quantile statistics

Minimum10
5-th percentile30
Q150
median65
Q395
95-th percentile132
Maximum194
Range184
Interquartile range (IQR)45

Descriptive statistics

Standard deviation32.74560808
Coefficient of variation (CV)0.4492228853
Kurtosis0.3064346895
Mean72.8939
Median Absolute Deviation (MAD)26.40625279
Skewness0.7509000578
Sum3644695
Variance1072.274848
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 10. 12.5 17.5 21.5 23.5 ... 167.5 172.5 177.5 187. 194. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
60 3140 6.3%
 
40 2973 5.9%
 
65 2838 5.7%
 
50 2533 5.1%
 
55 2244 4.5%
 
45 2072 4.1%
 
70 1900 3.8%
 
35 1795 3.6%
 
95 1769 3.5%
 
100 1719 3.4%
 
Other values (94) 27017 54.0%
 
ValueCountFrequency (%) 
10 196 0.4%
 
15 253 0.5%
 
20 446 0.9%
 
23 60 0.1%
 
24 112 0.2%
 
ValueCountFrequency (%) 
194 54 0.1%
 
180 197 0.4%
 
175 56 0.1%
 
170 210 0.4%
 
165 133 0.3%
 

SpDef_1st
Real number (ℝ≥0)

Distinct count92
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean72.07702
Minimum20
Maximum230
Zeros0
Zeros (%)0.0%
Memory size390.8 KiB

Quantile statistics

Minimum20
5-th percentile32
Q150
median70
Q390
95-th percentile120
Maximum230
Range210
Interquartile range (IQR)40

Descriptive statistics

Standard deviation27.91639795
Coefficient of variation (CV)0.3873134315
Kurtosis1.723017544
Mean72.07702
Median Absolute Deviation (MAD)22.14445928
Skewness0.8449180172
Sum3603851
Variance779.3252744
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 20. 21.5 24. 27.5 30.5 ... 152. 157. 180. 215. 230. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
80 3269 6.5%
 
50 3138 6.3%
 
55 2944 5.9%
 
65 2824 5.6%
 
60 2573 5.1%
 
75 2476 5.0%
 
70 2401 4.8%
 
90 2328 4.7%
 
45 2139 4.3%
 
95 1923 3.8%
 
Other values (82) 23985 48.0%
 
ValueCountFrequency (%) 
20 395 0.8%
 
23 73 0.1%
 
25 753 1.5%
 
30 1208 2.4%
 
31 67 0.1%
 
ValueCountFrequency (%) 
230 72 0.1%
 
200 72 0.1%
 
160 131 0.3%
 
154 172 0.3%
 
150 353 0.7%
 

Speed_1st
Real number (ℝ≥0)

Distinct count108
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean68.21442
Minimum5
Maximum180
Zeros0
Zeros (%)0.0%
Memory size390.8 KiB

Quantile statistics

Minimum5
5-th percentile25
Q145
median65
Q390
95-th percentile115
Maximum180
Range175
Interquartile range (IQR)45

Descriptive statistics

Standard deviation29.28829732
Coefficient of variation (CV)0.429356393
Kurtosis-0.2327932125
Mean68.21442
Median Absolute Deviation (MAD)24.1418225
Skewness0.3614687301
Sum3410721
Variance857.8043602
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 5. 7.5 12.5 17.5 21. ... 132.5 142.5 155. 170. 180. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
50 2854 5.7%
 
60 2584 5.2%
 
65 2348 4.7%
 
30 2294 4.6%
 
70 2182 4.4%
 
80 2085 4.2%
 
100 1971 3.9%
 
90 1952 3.9%
 
40 1896 3.8%
 
55 1849 3.7%
 
Other values (98) 27985 56.0%
 
ValueCountFrequency (%) 
5 138 0.3%
 
10 191 0.4%
 
15 555 1.1%
 
20 960 1.9%
 
22 57 0.1%
 
ValueCountFrequency (%) 
180 76 0.2%
 
160 63 0.1%
 
150 254 0.5%
 
145 182 0.4%
 
140 124 0.2%
 

Name_2nd
Categorical

HIGH CARDINALITY
Distinct count784
Unique (%)1.6%
Missing0
Missing (%)0.0%
Memory size390.8 KiB
Malamar
 
88
Pidove
 
88
Zubat
 
87
Leafeon
 
86
Nidorina
 
86
Other values (779)
49565
ValueCountFrequency (%) 
Malamar 88 0.2%
 
Pidove 88 0.2%
 
Zubat 87 0.2%
 
Leafeon 86 0.2%
 
Nidorina 86 0.2%
 
Murkrow 85 0.2%
 
Phione 84 0.2%
 
Walrein 84 0.2%
 
Mega Steelix 84 0.2%
 
Porygon-Z 83 0.2%
 
Other values (774) 49145 98.3%
 

Length

Max length25
Mean length8.34312
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 27 45.0%
 
Uppercase_Letter 26 43.3%
 
Other_Symbol 2 3.3%
 
Other_Punctuation 2 3.3%
 
Decimal_Number 1 1.7%
 
Dash_Punctuation 1 1.7%
 
Space_Separator 1 1.7%
 
ValueCountFrequency (%) 
Latin 53 88.3%
 
Common 7 11.7%
 
ValueCountFrequency (%) 
ASCII 57 96.6%
 
Misc Symbols 2 3.4%
 

Type1_2nd
Categorical

Distinct count18
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size390.8 KiB
Water
7026
Normal
6060
Bug
 
4374
Grass
 
4169
Psychic
 
3592
Other values (13)
24779
ValueCountFrequency (%) 
Water 7026 14.1%
 
Normal 6060 12.1%
 
Bug 4374 8.7%
 
Grass 4169 8.3%
 
Psychic 3592 7.2%
 
Fire 3286 6.6%
 
Rock 2792 5.6%
 
Electric 2697 5.4%
 
Ground 1964 3.9%
 
Dragon 1962 3.9%
 
Other values (8) 12078 24.2%
 

Length

Max length8
Mean length5.24682
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 17 60.7%
 
Uppercase_Letter 11 39.3%
 
ValueCountFrequency (%) 
Latin 28 100.0%
 
ValueCountFrequency (%) 
ASCII 28 100.0%
 

Type2_2nd
Categorical

Distinct count18
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size390.8 KiB
Flying
6320
Water
 
4475
Psychic
 
4448
Normal
 
3932
Grass
 
3544
Other values (13)
27281
ValueCountFrequency (%) 
Flying 6320 12.6%
 
Water 4475 8.9%
 
Psychic 4448 8.9%
 
Normal 3932 7.9%
 
Grass 3544 7.1%
 
Ground 3034 6.1%
 
Poison 2927 5.9%
 
Fighting 2892 5.8%
 
Fire 2484 5.0%
 
Fairy 2381 4.8%
 
Other values (8) 13563 27.1%
 

Length

Max length8
Mean length5.60544
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 17 60.7%
 
Uppercase_Letter 11 39.3%
 
ValueCountFrequency (%) 
Latin 28 100.0%
 
ValueCountFrequency (%) 
ASCII 28 100.0%
 

HP_2nd
Real number (ℝ≥0)

Distinct count92
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean69.09994
Minimum1
Maximum255
Zeros0
Zeros (%)0.0%
Memory size390.8 KiB

Quantile statistics

Minimum1
5-th percentile35
Q150
median65
Q380
95-th percentile110
Maximum255
Range254
Interquartile range (IQR)30

Descriptive statistics

Standard deviation25.17010812
Coefficient of variation (CV)0.3642565843
Kurtosis6.960784184
Mean69.09994
Median Absolute Deviation (MAD)18.65516448
Skewness1.509475596
Sum3454997
Variance633.5343427
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 5.5 15. 22.5 29. ... 162.5 167.5 180. 252.5 255. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
60 4209 8.4%
 
50 3874 7.7%
 
70 3582 7.2%
 
65 2984 6.0%
 
80 2625 5.2%
 
75 2616 5.2%
 
45 2429 4.9%
 
55 2355 4.7%
 
40 2354 4.7%
 
100 1930 3.9%
 
Other values (82) 21042 42.1%
 
ValueCountFrequency (%) 
1 64 0.1%
 
10 68 0.1%
 
20 371 0.7%
 
25 117 0.2%
 
28 78 0.2%
 
ValueCountFrequency (%) 
255 64 0.1%
 
250 46 0.1%
 
190 72 0.1%
 
170 73 0.1%
 
165 66 0.1%
 

Atk_2nd
Real number (ℝ≥0)

Distinct count111
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean79.0351
Minimum5
Maximum190
Zeros0
Zeros (%)0.0%
Memory size390.8 KiB

Quantile statistics

Minimum5
5-th percentile30
Q155
median75
Q3100
95-th percentile140
Maximum190
Range185
Interquartile range (IQR)45

Descriptive statistics

Standard deviation32.41358297
Coefficient of variation (CV)0.4101163023
Kurtosis0.1621886516
Mean79.0351
Median Absolute Deviation (MAD)25.79573572
Skewness0.5530351892
Sum3951755
Variance1050.640361
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 5. 12.5 17.5 21. 22.5 ... 164.5 167.5 175. 182.5 190. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
100 2465 4.9%
 
65 2432 4.9%
 
80 2432 4.9%
 
50 2271 4.5%
 
85 2123 4.2%
 
75 1958 3.9%
 
55 1937 3.9%
 
70 1914 3.8%
 
60 1900 3.8%
 
90 1822 3.6%
 
Other values (101) 28746 57.5%
 
ValueCountFrequency (%) 
5 100 0.2%
 
10 193 0.4%
 
15 65 0.1%
 
20 510 1.0%
 
22 56 0.1%
 
ValueCountFrequency (%) 
190 60 0.1%
 
185 69 0.1%
 
180 196 0.4%
 
170 97 0.2%
 
165 179 0.4%
 

Def_2nd
Real number (ℝ≥0)

Distinct count103
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean74.1486
Minimum5
Maximum230
Zeros0
Zeros (%)0.0%
Memory size390.8 KiB

Quantile statistics

Minimum5
5-th percentile35
Q150
median70
Q390
95-th percentile130
Maximum230
Range225
Interquartile range (IQR)40

Descriptive statistics

Standard deviation31.57830809
Coefficient of variation (CV)0.4258786827
Kurtosis2.842687063
Mean74.1486
Median Absolute Deviation (MAD)24.14912257
Skewness1.204450122
Sum3707430
Variance997.1895418
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 5. 7.5 12.5 21.5 24. ... 164. 174. 182. 215. 230. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
70 3267 6.5%
 
50 3113 6.2%
 
60 2908 5.8%
 
80 2537 5.1%
 
90 2269 4.5%
 
40 2214 4.4%
 
65 2157 4.3%
 
55 2105 4.2%
 
100 2033 4.1%
 
45 1978 4.0%
 
Other values (93) 25419 50.8%
 
ValueCountFrequency (%) 
5 100 0.2%
 
10 64 0.1%
 
15 234 0.5%
 
20 249 0.5%
 
23 59 0.1%
 
ValueCountFrequency (%) 
230 214 0.4%
 
200 138 0.3%
 
184 75 0.1%
 
180 203 0.4%
 
168 64 0.1%
 

SpAtk_2nd
Real number (ℝ≥0)

Distinct count104
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean72.66422
Minimum10
Maximum194
Zeros0
Zeros (%)0.0%
Memory size390.8 KiB

Quantile statistics

Minimum10
5-th percentile30
Q150
median65
Q395
95-th percentile132
Maximum194
Range184
Interquartile range (IQR)45

Descriptive statistics

Standard deviation32.6445981
Coefficient of variation (CV)0.4492527148
Kurtosis0.3606340979
Mean72.66422
Median Absolute Deviation (MAD)26.29538269
Skewness0.7729417703
Sum3633211
Variance1065.669785
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 10. 12.5 17.5 21.5 23.5 ... 167.5 172.5 177.5 187. 194. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
60 3159 6.3%
 
40 3094 6.2%
 
65 2850 5.7%
 
50 2504 5.0%
 
55 2292 4.6%
 
45 1993 4.0%
 
70 1893 3.8%
 
35 1862 3.7%
 
95 1761 3.5%
 
80 1688 3.4%
 
Other values (94) 26904 53.8%
 
ValueCountFrequency (%) 
10 187 0.4%
 
15 233 0.5%
 
20 434 0.9%
 
23 70 0.1%
 
24 109 0.2%
 
ValueCountFrequency (%) 
194 71 0.1%
 
180 203 0.4%
 
175 54 0.1%
 
170 193 0.4%
 
165 115 0.2%
 

SpDef_2nd
Real number (ℝ≥0)

Distinct count92
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean72.01844
Minimum20
Maximum230
Zeros0
Zeros (%)0.0%
Memory size390.8 KiB

Quantile statistics

Minimum20
5-th percentile35
Q150
median70
Q390
95-th percentile120
Maximum230
Range210
Interquartile range (IQR)40

Descriptive statistics

Standard deviation27.83605283
Coefficient of variation (CV)0.3865128546
Kurtosis1.611477979
Mean72.01844
Median Absolute Deviation (MAD)22.10265705
Skewness0.8483840043
Sum3600922
Variance774.8458369
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 20. 21.5 24. 27.5 30.5 ... 152. 157. 180. 215. 230. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
80 3194 6.4%
 
50 3061 6.1%
 
55 2898 5.8%
 
65 2796 5.6%
 
60 2684 5.4%
 
70 2404 4.8%
 
75 2392 4.8%
 
45 2289 4.6%
 
90 2282 4.6%
 
85 1973 3.9%
 
Other values (82) 24027 48.1%
 
ValueCountFrequency (%) 
20 348 0.7%
 
23 59 0.1%
 
25 695 1.4%
 
30 1129 2.3%
 
31 71 0.1%
 
ValueCountFrequency (%) 
230 63 0.1%
 
200 68 0.1%
 
160 152 0.3%
 
154 179 0.4%
 
150 362 0.7%
 

Speed_2nd
Real number (ℝ≥0)

Distinct count108
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean68.27922
Minimum5
Maximum180
Zeros0
Zeros (%)0.0%
Memory size390.8 KiB

Quantile statistics

Minimum5
5-th percentile25
Q145
median65
Q390
95-th percentile115
Maximum180
Range175
Interquartile range (IQR)45

Descriptive statistics

Standard deviation29.10855548
Coefficient of variation (CV)0.4263164618
Kurtosis-0.265218202
Mean68.27922
Median Absolute Deviation (MAD)24.0121384
Skewness0.3450940719
Sum3413961
Variance847.3080024
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 5. 12.5 17.5 21. 22.5 ... 129. 132.5 155. 170. 180. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
50 2881 5.8%
 
60 2631 5.3%
 
65 2322 4.6%
 
30 2267 4.5%
 
70 2214 4.4%
 
80 2115 4.2%
 
100 2039 4.1%
 
90 1978 4.0%
 
55 1833 3.7%
 
40 1807 3.6%
 
Other values (98) 27913 55.8%
 
ValueCountFrequency (%) 
5 115 0.2%
 
10 184 0.4%
 
15 563 1.1%
 
20 978 2.0%
 
22 72 0.1%
 
ValueCountFrequency (%) 
180 69 0.1%
 
160 54 0.1%
 
150 225 0.4%
 
145 191 0.4%
 
140 130 0.3%
 

Win
Categorical

HIGH CARDINALITY
Distinct count783
Unique (%)1.6%
Missing0
Missing (%)0.0%
Memory size390.8 KiB
Mewtwo
 
152
Infernape
 
136
Aerodactyl
 
136
Jirachi
 
134
Slaking
 
133
Other values (778)
49309
ValueCountFrequency (%) 
Mewtwo 152 0.3%
 
Infernape 136 0.3%
 
Aerodactyl 136 0.3%
 
Jirachi 134 0.3%
 
Slaking 133 0.3%
 
Deoxys Speed Forme 133 0.3%
 
Murkrow 130 0.3%
 
Mega Absol 130 0.3%
 
Mega Houndoom 128 0.3%
 
Mega Rayquaza 127 0.3%
 
Other values (773) 48661 97.3%
 

Length

Max length25
Mean length8.78026
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 27 45.0%
 
Uppercase_Letter 26 43.3%
 
Other_Punctuation 2 3.3%
 
Other_Symbol 2 3.3%
 
Decimal_Number 1 1.7%
 
Space_Separator 1 1.7%
 
Dash_Punctuation 1 1.7%
 
ValueCountFrequency (%) 
Latin 53 88.3%
 
Common 7 11.7%
 
ValueCountFrequency (%) 
ASCII 57 96.6%
 
Misc Symbols 2 3.4%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

First_pokemonSecond_pokemonWinner1st_Win2nd_WinWho_WinName_1stType1_1stType2_1stHP_1stAtk_1stDef_1stSpAtk_1stSpDef_1stSpeed_1stName_2ndType1_2ndType2_2ndHP_2ndAtk_2ndDef_2ndSpAtk_2ndSpDef_2ndSpeed_2ndWin
0266298298012ndLarvitarRockGround506450455041NuzleafGrassDark707040604060Nuzleaf
1702701701012ndVirizionGrassFighting91907290129108TerrakionRockFighting91129907290108Terrakion
2191668668012ndTogeticFairyFlying5540858010540BeheeyemPsychicPsychic7575751259540Beheeyem
3237683683012ndSlugmaFireFire404040704020DruddigonDragonDragon7712090609048Druddigon
4151231151101stOmastarRockWater70601251157055ShuckleBugRock2010230102305Omastar
5657752657101stJoltikBugElectric504750575065Aegislash Shield FormeSteelGhost60501505015060Joltik
6192134134012ndNatuPsychicFlying405045704570JynxIcePsychic6550351159595Jynx
773545545012ndMachopFightingFighting708050353535Giratina Altered FormeGhostDragon15010012010012090Giratina Altered Forme
8220763763012ndPinecoBugBug506590353515ClauncherWaterWater505362586344Clauncher
93023131012ndWingullWaterFlying403030553085PikachuElectricElectric355540505090Pikachu

Last rows

First_pokemonSecond_pokemonWinner1st_Win2nd_WinWho_WinName_1stType1_1stType2_1stHP_1stAtk_1stDef_1stSpAtk_1stSpDef_1stSpeed_1stName_2ndType1_2ndType2_2ndHP_2ndAtk_2ndDef_2ndSpAtk_2ndSpDef_2ndSpeed_2ndWin
49990204368368012ndSkiploomGrassFlying554550456580ZangooseNormalNormal7311560606090Zangoose
49991695717717012ndDeinoDarkDragon526550455038Meloetta Pirouette FormeNormalFighting100128907777128Meloetta Pirouette Forme
49992592703703012ndMega AudinoNormalFairy103601268012650Tornadus Incarnate FormeFlyingFlying791157012580111Tornadus Incarnate Forme
49993728762728101stBunnelbyNormalNormal383638323657DragalgePoisonDragon6575909712344Bunnelby
49994657681681012ndJoltikBugElectric504750575065MienfooFightingFighting458550555065Mienfoo
49995707126707101stReshiramDragonFire10012010015012090HorseaWaterWater304070702560Reshiram
49996589664589101stDrilburGroundGround608540304568TynamoElectricElectric355540454060Drilbur
49997303368368012ndPelipperWaterFlying6050100857065ZangooseNormalNormal7311560606090Zangoose
4999810989109101stVoltorbElectricElectric4030505555100MagnemiteElectricSteel253570955545Voltorb
499999739101stMega Charizard YFireFlying7810478159115100MachopFightingFighting708050353535Mega Charizard Y